Comparison of Cox Model Methods in A Low-dimensional Setting with Few Events

نویسندگان

  • Francisco M. Ojeda
  • Christian Müller
  • Daniela Börnigen
  • David-Alexandre Trégouët
  • Arne Schillert
  • Matthias Heinig
  • Tanja Zeller
  • Renate B. Schnabel
چکیده

Prognostic models based on survival data frequently make use of the Cox proportional hazards model. Developing reliable Cox models with few events relative to the number of predictors can be challenging, even in low-dimensional datasets, with a much larger number of observations than variables. In such a setting we examined the performance of methods used to estimate a Cox model, including (i) full model using all available predictors and estimated by standard techniques, (ii) backward elimination (BE), (iii) ridge regression, (iv) least absolute shrinkage and selection operator (lasso), and (v) elastic net. Based on a prospective cohort of patients with manifest coronary artery disease (CAD), we performed a simulation study to compare the predictive accuracy, calibration, and discrimination of these approaches. Candidate predictors for incident cardiovascular events we used included clinical variables, biomarkers, and a selection of genetic variants associated with CAD. The penalized methods, i.e., ridge, lasso, and elastic net, showed a comparable performance, in terms of predictive accuracy, calibration, and discrimination, and outperformed BE and the full model. Excessive shrinkage was observed in some cases for the penalized methods, mostly on the simulation scenarios having the lowest ratio of a number of events to the number of variables. We conclude that in similar settings, these three penalized methods can be used interchangeably. The full model and backward elimination are not recommended in rare event scenarios.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Survival of Dialysis Patients Using Random Survival Forest Model in Low-Dimensional Data with Few-Events

Background:Dialysis is a process for eliminating extra uremic fluids of patients with chronic renal failure. The present study aimed to determine the variables that influence the survival of dialysis patients using random survival forest model (RSFM) in low-dimensional data with low events per variable (EPV). Methods:In this historical cohort study, infor...

متن کامل

Developing a CoMSIA model for inhibition of COX-2 by resveratrol derivatives

Design of selective cyclooxygenase-2 (COX-2) inhibitors is still a challenging task because of active site similarities between COX isoenzymes. To help with this issue, we tried to generate a 3D-QSAR (3 dimensional quantitative structure activity relationship) model that might reflect the essential features of COX-2 active sites. Compounds in a series of resveratrol derivatives inhibitors with ...

متن کامل

Developing a CoMSIA model for inhibition of COX-2 by resveratrol derivatives

Design of selective cyclooxygenase-2 (COX-2) inhibitors is still a challenging task because of active site similarities between COX isoenzymes. To help with this issue, we tried to generate a 3D-QSAR (3 dimensional quantitative structure activity relationship) model that might reflect the essential features of COX-2 active sites. Compounds in a series of resveratrol derivatives inhibitors with ...

متن کامل

Penalized Estimators in Cox Regression Model

The proportional hazard Cox regression models play a key role in analyzing censored survival data. We use penalized methods in high dimensional scenarios to achieve more efficient models. This article reviews the penalized Cox regression for some frequently used penalty functions. Analysis of medical data namely ”mgus2” confirms the penalized Cox regression performs better than the cox regressi...

متن کامل

Comparison of Ordinal Response Modeling Methods like Decision Trees, Ordinal Forest and L1 Penalized Continuation Ratio Regression in High Dimensional Data

Background: Response variables in most medical and health-related research have an ordinal nature. Conventional modeling methods assume predictor variables to be independent, and consider a large number of samples (n) compared to the number of covariates (p). Therefore, it is not possible to use conventional models for high dimensional genetic data in which p > n. The present study compared th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 14  شماره 

صفحات  -

تاریخ انتشار 2016